Reinforcement learning - PDFSEARCH.IO - Document Search Engine

Reinforcement learning
Results: 1147

#	Item
661	Concurrent Hierarchical Reinforcement Learning Bhaskara Marthi, David Latham, Stuart Russell Carlos Guestrin Dept of Computer Science Add to Reading List Source URL: www.cs.berkeley.edu Language: English - Date: 2004-09-02 13:55:11 Functional languages Lisp programming language Common Lisp Cross-platform software Reinforcement learning Markov decision process Ordinal number Function Lisp Software engineering Computer programming Computing
662	Approximately Efficient Online Mechanism Design David C. Parkes DEAS, Maxwell-Dworkin Harvard University Add to Reading List Source URL: www.eecs.harvard.edu Language: English - Date: 2005-01-05 13:04:15 Operations research Science Dynamic programming Markov processes Stochastic control Reinforcement learning Mechanism design Markov decision process Vickrey–Clarke–Groves auction Statistics Control theory Game theory
663	Learning for stochastic dynamic programming Sylvain Gelly and J´er´emie Mary and Olivier Teytaud ∗ IA-TAO, Lri, Bˆ at. 490, Add to Reading List Source URL: eprints.pascal-network.org Language: English - Date: 2006-11-07 05:40:56 Operations research Cybernetics Mathematical optimization Search algorithms Reinforcement learning Perceptron Dynamic programming Regression analysis Support vector machine Machine learning Statistics Artificial intelligence
664	From: AAAI-93 Proceedings. Copyright © 1993, AAAI (www.aaai.org). All rights reserved. Planning Thomas Wit Add to Reading List Source URL: aaai.org Language: English - Date: 2006-01-09 21:10:32 Control theory Mathematical optimization Equations Reinforcement learning Automated planning and scheduling Anytime algorithm Algorithm Dynamic programming Shortest path problem Mathematics Operations research Applied mathematics
665	multi-bandit_techreport.dvi Add to Reading List Source URL: www.princeton.edu Language: English - Date: 2011-10-26 19:05:07 Stochastic optimization Markov models Reinforcement learning Variance Algorithm Statistics Machine learning Multi-armed bandit
666	Copyright by Jefferson Provost 2007 The Dissertation Committee for Jefferson Provost Add to Reading List Source URL: ftp.cs.utexas.edu Language: English - Date: 2007-08-17 16:29:06 Reinforcement learning Robotics Mobile robot Robot Reinforcement Action learning E-learning Behavior Learning Education Behaviorism
667	Learning to Follow Navigational Directions Adam Vogel and Dan Jurafsky Department of Computer Science Stanford University {acvogel,jurafsky}@stanford.edu Add to Reading List Source URL: nlp.stanford.edu Language: English - Date: 2010-05-17 17:59:35 SARSA Q-learning Reinforcement learning Temporal difference learning Machine learning Algorithm Apprenticeship learning Spatial memory Artificial intelligence Learning Mathematics
668	Consistent exploration improves convergence of reinforcement learning on POMDPs Paul A. Crook Gillian Hayes Add to Reading List Source URL: homepages.inf.ed.ac.uk Language: English - Date: 2007-07-04 12:19:49 Stochastic control SARSA Markov models Theoretical computer science Reinforcement learning Q-learning Council on Environmental Quality Temporal difference learning Partially observable Markov decision process Statistics Markov processes Dynamic programming
669	Planning in Models that Combine Memory with Predictive Representations of State Add to Reading List Source URL: aaai.org Language: English - Date: 2006-01-11 01:12:22 Stochastic control Partially observable Markov decision process Markov decision process Markov model Dynamical system Reinforcement learning Pruning Linear programming Constraint algorithm Statistics Dynamic programming Markov processes
670	Multi-Bandit Best Arm Identification Victor Gabillon Mohammad Ghavamzadeh Alessandro Lazaric INRIA Lille - Nord Europe, Team SequeL Add to Reading List Source URL: www.princeton.edu Language: English - Date: 2011-10-26 19:05:10 Stochastic optimization Markov models Variance Reinforcement learning Statistics Machine learning Multi-armed bandit

UPDATE